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Abstract 

We study the evolution of large but finite asexual populations evolving 
in fitness landscapes in which all mutations are either neutral or strongly 
deleterious. We demonstrate that despite the absence of higher fitness 
genotypes, adaptation takes place as regions with more advantageous dis- 
tributions of neutral genotypes are discovered. Since these discoveries are 
typically rare events, the population dynamics can be subdivided into sep- 
arate epochs, with rapid transitions between them. Within one epoch, the 
average fitness in the population is approximately constant. The transi- 
tions between epochs, however, are generally accompanied by a significant 
increase in the average fitness. We verify our theoretical considerations 
with two analytically tractable bitstring models. 



1. Introduction 

Sudden bursts of adaptive activity which punctuate long periods of stagnancy 
seem to be a common observation in evolving systems. Such epochal evolution 
[van Nimwegen et al., 1997], has been found in the fossil record [Eldredge and 
Gould, 1972, Gould and Eldredge, 1977], in evolving bacteria [Lenski and Trav- 
isano, 1994, Elena et al., 1996], the evolution of tRNA structures [Huynen et al., 
1996, Fontana and Schuster, 1998] or artificial systems such as digital organ- 
isms [Adami, 1995] and evolutionary optimization [Vose and Liepins, 1991, van 
Nimwegen et al., 1997]. The most complete theoretical analysis of epochal evo- 
lution has probably been presented in a series of papers by van Nimwegen and 
coworkers [van Nimwegen et al., 1997, 1999b, van Nimwegen and Crutchfield, 
2000a,b]. The general picture is as follows. A population can easily climb to 
the nearest local optimum, but escape from there only with difficulty. Once a 
local optimum has been reached, the population is trapped and experiences a 
metastable equilibrium. With a relatively low probability, the population can 
discover a portal genotype leading to the next local optimum, i.e., a genotype 
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with a higher fitness than what is currently present in the population. Once a 
portal is discovered, the population moves quickly away from its current peak, 
and towards the new peak. There, it settles down again in equilibrium, until the 
next portal is discovered. 

The above description focuses on the dynamics between local optima, but not 
on the dynamics at one local optimum. However, in the presence of neutrality, 

1. e., when a number of genotypes share the same identical replication rate, the 
dynamics around one local optimum can be quite intriguing. A population does 
not drift over a set of neutral genotypes, like a set of random walkers, but has 
a quite different, even to some extent deterministic dynamics. In a completely 
flat fitness landscape, for example, a population does not assume a Gaussian 
distribution with a variance that increases over time, as would be expected from 
a simple diffusion process. Rather, the population stays clustered together and 
the cluster moves about as a whole [Derrida and Peliti, 1991]. If neutral and 
deleterious genotypes are mixed, i.e., if each genotype has as direct neighbors 
in genotype space both neutral and deleterious genotypes, then the population 
moves to the regions of the most connected neutral genotypes, as long as the 
mutation rate is small [van Nimwegen et al., 1999a]. 

In this paper, we study the dynamics of a finite but large population of asex- 
ually replicating genetic sequences on a fitness landscape that contains both 
neutral and strongly deleterious genotypes. We show that the neutral genotypes 
naturally decompose into disjunct sets, and that an evolving population can be 
trapped within such sets. Moreover, different sets yield different reproductive 
success for the populations residing on them. When a population discovers a set 
of genotypes with higher reproductive success, the population moves over to that 
set. For the average fitness of the population, such transitions are reflected in a 
stepwise increase, exactly as it is observed in standard scenarios of epochal evo- 
lution. However, the increase observed here is not due to the discovery of faster 
replicating genotypes, but solely to the discovery of genotypes with increased 
robustness against mutations. Our results are valid for arbitrary mutation rates, 
and they generalize the previous findings of van Nimwegen et al. [1999a]. 

2. Theory 

We assume that there exist two classes of genotypes, those with a relatively high 
replication rate a, and the ones with a much lower replication rate. For reasons 
of simplicity, we assume the latter to have replication rate 0. This assumption 
is quite common in the literature [Gavrilets, 1997, 1999, van Nimwegen et al., 
1999a]. Our analysis is similar to the standard treatment of the quasispecies 
model (see for example Schuster and Swetina [1988], Eigen et al. [1988, 1989], 
Wilke et al. [2001]) and to the work of van Nimwegen et al. [1999b, a]. Let the 
vector a define the set of neutral genotypes, i.e., the genotypes with replication 
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rate o: 

1 if i has replication rate a, , . 

else, ^ ' 

where i runs over all possible genotypes. We assume a discrete time model, 
and write the average population fitness as (/). In equilibrium, we have [van 
Nimwegen et al., 1999b] 

x = ^QAx, (2) 

where x is the vector of concentrations, the diagonal matrix A = diag(ao, a-i, ■ ■ ■ ) 
contains the set of neutral genotypes, and the matrix Q defines the mutation 
probabilities between different genotypes, i.e., genotype j mutates into genotype 
% with probability Qij. 

In the following, we assume that the genotypes can be represented as sequences 
of length / over an alphabet of A different symbols. Moreover, we assume a 
uniform copy fidelity q per symbol. That means, the / symbols in a sequence 
mutate independently from each other, and the substitution probability in one 
generation is 1 — q for each symbol. With this assumption, the mutation matrix 
can be written as [Swetina and Schuster, 1982] 

/ 1 _ a \ d(i ' j) 

where d(i,j) is the Hamming distance between sequences i and j. It is useful to 
introduce the reduced mutation rate /2, 

* = i& (4) 

which allows us to write Q as a sum of matrices, 

Q = q l J2^ k K (5) 

fc=0 

The matrices define the connection graphs at Hamming distance k in se- 
quence space, i.e., 

(fc ) / 1 if d(i,j) = k, 

v \ else. W 
We insert (5) into (2), and obtain 

* = ^E^ D(fc)A - ( 7 ) 
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It is useful to introduce the matrices 



(8) 



These matrices define the connection graphs at Hamming distance k for the neu- 
tral genotypes. We now disregard all non-neutral sequences, and introduce the 
concentration vector p, which holds the concentrations of all neutral sequences. 
The total number of neutral sequences is then P = J2iPi- Moreover, we assume 
all columns and rows corresponding to non-neutral sequences to be deleted from 
the matrices G^. From equation (7), we obtain the eigenvalue equation 



Consequently, the equilibrium state of the population is fully determined by the 
matrix 



In the following, we will call matrices such as G generalized connection matrices. 
The difference to the normal connection matrices G^ as defined in equation (8) 
is that generalized connection matrices may contain powers of jl, i.e., they define 
a connection graph with weighted edges, whereas for the graphs defined by G^, 
all edges have the same weight. 

We can obtain further insight from relating the average population fitness 
(/) to the average fraction of neutral offspring, (u). Under the assumption that 
the non-neutral sequences have a vanishing replication rate (this assumption is 
equivalent to neglecting back mutations), we have [van Nimwegen et al., 1999a] 



The probability v, L for a single sequence % to have a neutral genotype as offspring 
is given by 





G = £ £* -1 G (fc) 



= G«+/iG( 2 )+/i 2 G( 3 )... 



(10) 




(11) 




3 




k=0 



3 




(12) 
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where d\® gives the number of neutral neighbors at Hamming distance k of 
sequence %. When we take the average over all viable sequences in the population 
on both sides of equation (12), we arrive at 

The quantities (d^) give the average number of neutral neighbors at Hamming 
distance k in the population. They can be expressed as 



» 3 

With equation (11), we can rewrite equation (13) as 

£/i*<dW> = <Z>_l. (15) 

k=l a 1 

We notice that the right- hand- side of (15) is identical to the factor in front of p 
on the left-hand-side of (9). Therefore, we arrive at 

J2^- l (d {k) ))p= fo^GwV (16) 
yk=i J \k=i J 

In the limit of a small mutation rate, (16) becomes 

(d^)p = G {1) p. (17) 

In that case, the equilibrium distribution p depends solely on the one-mutant 
connections of the neutral sequences, and the population neutrality (d^) is 
given by the spectral radius of the first-order connection matrix. Equation (17) 
corresponds to the result of van Nimwegen et al. [1999a]. Thus, we find that 
equation (16) is the generalization of that result to arbitrary mutation rates. For 
larger mutation rates, the equilibrium distribution is influenced by the higher 
order connection matrices, and the population neutrality may deviate from the 
spectral radius of G^. 



2.1. Separate neutral networks 

Following the conventional nomenclature in the literature [Forst et al., 1995], we 
call a set of neutral sequences that can be transversed by one-point mutations 
a neutral network. The full set of neutral sequences in genotype space will in 
general decompose into several disjunct such neutral networks. Assume there 
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are n disjunct neutral networks. By reordering sequences, we can arrange G in 
block-matrix form 



G 



/Gi \ 

G 2 0{p) 
G 3 



(18) 



where the matrices G, are the generalized connection matrices for the different 
neutral networks, and all off-diagonal terms are at least of the order of jx. Let 
us further assume that the matrices Gj are ordered with descending spectral 
radius, i.e., the spectral radius of G« is larger than the one of Gj+i for all i. 
For a finite population initially residing on network n, we can then expect the 
following dynamics. If the mutation rate is small, the population will equilibrate 
within network n. The discovery of a sequence that is part of a neutral network 
% < n is very unlikely, due to the small off-diagonal terms. However, eventually 
such a sequence will be discovered. Since a progeny of that sequence has a smaller 
probability to fall off its neutral network (Gj has a larger spectral radius than 
G n ), the sequences on network i will have a higher reproductive success than 
the sequences on network n, and the population will move over to the newly 
discovered network. There, the population will equilibrate, until the next higher- 
connected network is discovered. If the mutation rate is large, on the other hand, 
the off-diagonal terms cannot be considered small anymore. In that case, the 
discovery of higher connected regions is much more likely, and a population will 
move straight away into the more densely connected regions of the genotype 
space. In Sec. 4, we present examples for both of these behaviors. 

The above considerations show that the decomposition of the full set of neutral 
sequences into neutral networks according to the definition given at the begin- 
ning of this subsection is somewhat arbitrary. Depending on the mutation rate, it 
might be justified, for example, to disregard contributions of order 0(£i 2 ) to the 
matrix G, but not the ones of order 0{p). In that case, it would be more natural 
to group the sequences into sets that can be transversed by a combination of 
one- or two-point mutations. In general, the neutral sequences should therefore 
be subdivided into sets such that two arbitrary sequences of two disjunct sets are 
at least a Hamming distance k apart, where k represents the smallest number 
of simultaneous mutations that can be considered rare at the respective muta- 
tion rate. For the remainder of this paper, we will understand the term neutral 
network in this more general sense. 



3. A simple exactly solvable landscape 

In this section, we study a simple example landscape for which the matrix G 
can be diagonalized exactly, to all orders in p,. We consider binary sequences 
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of length / = 2n, and break them down into n pairs of bits. For each pair, we 
assume that there are three states (00, 01, 10) which are neutral, and one state 
(11) which is lethal. Therefore, a sequence which contains at least one pair for 
which both bits are set to 1 has a fitness of 0, and all other sequences have a 
fitness of a. For simplicity, we set a — 1. We will refer to this landscape as the 
Neutral Bitpairs landscape. 

For a single pair, the matrix Gi reads (in this section, the subscript % in Gj 
indicates the number of pairs we are considering) 



Gi = 1 fi , (19) 




and its largest eigenvalue is Ai = (p,+ a/8 + n 2 )/2. For a sequence with 2 pairs, 
the corresponding matrix G2 can be written as a tensor product (see Rumschitzki 
[1987], Dress and Rumschitzki [1988], Wilke [1999]), 

G 2 = /T 1 [(l + /iGi)®(l + /iGi)-l] . (20) 

The symbol 1 stands for the identity matrix in the appropriate matrix space. 
In general, for a sequence consisting of n pairs, we can define the matrix G n 
recursively, 

G n = /T 1 [(1 + /2G0 ® (1 + ^G n _x) - 1] . (21) 
As a consequence, the largest eigenvalue of G n reads 
A n = /i- 1 [(l + /iAi) n -l], 

= ^ 1 [(l + Y + 7 ]v / 8T^) n -l]- (22) 

The average population fitness in this landscape follows from equation (9). We 
obtain 

</> = ^(l + y + £>/8 + £T- (23) 

For small mutation rates, we can write the average fitness as 

(f} = q 2n (l + yftnji), (24) 

and the average population neutrality becomes 

(S 1) ) = V2n. (25) 

Let us now compare the full solution for the average population fitness to the 
approximation given by van Nimwegen et al. [1999a], 
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spectral radius of G' 1 ' 
• • • • 



network neutrality v 



0.1 1 10 

genomic mutation rate fj, 

Figure 1: a: Average fitness as a function of the mutation rate in a Neutral Bitpairs landscape 
with n = 10. The solid line represents equation (23), the dotted line is the approximation 
equation (26) with (d^) given by equation (25), and the points stem from simulations with 
a population of size N — 10000. b: Average population neutrality (d^) as a function of the 
mutation rate for the simulations from graph a. The solid line indicates the spectral radius of 
the first-order connection matrix and the dotted line gives the network neutrality v = |Z. 

where \i is the genomic mutation rate, 

H = l(l-q). (27) 

Equation (26) follows from equation (24) if we disregard all terms of the order 
C?((l — g) 2 ) or higher. Figure la shows the exact solution for the average fitness 
(/) equation (23) and the approximation equation (26) as a function of the 
genomic mutation rate \i. For comparison, we have also displayed results from 
numerical simulations with a genetic algorithm. As was to be expected, the 
approximation works well for /i <C 1, but breaks down for higher mutation 
rates. For /i > 1, the approximation significantly underestimates the average 
population fitness. Interestingly, this goes along with a decrease in the population 
neutrality (Fig. lb). What happens is the following. In general, a population 
moves to that region of the genotype space where the probability of neutral 
offspring is maximized for the given mutation rate. Clearly, when the mutation 
rate is low, only the immediate neighbors influence that probability. If, however, 
the mutation rate is high, such that offspring with two or even more mutations 
become common, the immediate neighbors lose their importance. In the extreme 
case of a per symbol copy fidelity of q = 0.5, the probability of giving birth to a 
viable sequence becomes identical for all sequences. Hence, in that extreme we 
can expect the population neutrality to coincide with the network neutrality v 
(the network neutrality is the average number of neutral neighbors of a viable 
sequence), which amounts to v — -^l in our case. At the same time, most of 
the average population fitness does not stem from the one-mutants anymore, 
but from mutants further away, which explains why the approximation equation 
(26) underestimates the true average fitness. 
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repairs A: bits n pairs Abits repairs 

{ A ^ f A 1 f A 1 f A 1 f A ^ 

^...^^...o...oo^...^i^i...ii^...j_g{n J 

block b block 2 block 1 

Figure 2: A valid string in the Neutral Staircase landscape. None of the two active blocks 
(block 1 and 2) contains a pair 11, all k bits between blocks 1 and 2 are set to 1, and all bits 
to the left of block 2 are set to 0. 



4. The Neutral Staircase landscape 

The fitness landscape that we have studied in the previous section contains only a 
single large neutral network, and hence epochal dynamics as predicted in Sec. 2.1 
cannot be observed in that landscape. However, with a small modification, we 
can create a landscape that possesses the required properties. We subdivide a 
binary sequence into b blocks of length 2n, with a set of k bits in between each 
block (Fig. 2). The total length of the sequence is thus / = 2bn + k{b — 1). The 
blocks can be active or inactive. An active block has properties similar to the 
sequences of the previous section. The block is subdivided into n pairs. If any of 
those n pairs are in the configuration 11, the fitness of the whole sequence is zero. 
Otherwise, the fitness is not affected by that block. The rightmost block is always 
active. A block further to the left becomes active if the block immediately to its 
right is active, and all k bits between the two blocks are set to 1. Finally, any 
bit to the left of the leftmost active block that is set to 1 results in fitness zero 
for the whole sequence. We call this landscape the Neutral Staircase landscape, 
in analogy to the Royal Staircase introduced by van Nimwegen and Crutchfield 
[2000b]. Note that the Neutral Staircase differs from the Royal Staircase in an 
important aspect: all sequences have either fitness or fitness 1. No higher fitness 
genotypes can be discovered, and the population's dynamics is determined only 
by the topology of the neutral sequences. 

The analytical treatment of the Neutral Staircase landscape is straightforward 
for k > 1. The neutral sequences decompose into b neutral networks, one for each 
possible number of blocks that can be active. Within one neutral network, the 
average population fitness is readily available from (23). Namely, if % counts the 
number of active blocks, we have 

(/ l ) = ^(l + y + ^V / 8T^) m - (28) 

Likewise, it is simple to derive the probability with which a new block is dis- 
covered. First, we note that the probability with which an offspring sequence 
remains on the neutral network i is given by (z/j) = (/j), according to (11). 
Hence, a single offspring sequence ends up on the next neutral network with 
probability {fi)fi> k q~ 2n (l — q 2 fl 2 ) n (we need k extra mutations to set the k bits 
left of the leftmost active block to one, but we are allowed some miscopies in 
the newly activated block). In a finite population of sequences, there are on 




Figure 3: Two example runs on a Neutral Staircase landscape. If the entropic barrier between 
the neutral networks is large (a), the population moves from one network to the next in discrete 
jumps. If the barrier is small (b), the population moves straight away to the network that yields 
the highest average population fitness. We used n = 3, b = 5, q = 0.99, and a population size 
of N = 10000. In graph a, wc used k = 3, and in graph b, we used k = 2. 



average N(fi) sequences that can give rise to offspring. A sequence that belongs 
to the network % + 1 is therefore created in one generation with probability 

P creaii = 1 - [1 - (fi)fl k q- 2n (l - <?j?) n ] N{fi) ■ (29) 

Next, we are interested in the probability of fixation of that newly discovered 
sequence, 7Tj. For large populations, 7Tj can be approximated by 2s [Haldane, 
1927, Kimura, 1964], where s is the selective advantage of the newly discovered 
network. In our case, we obtain thus 

7r i = 2(/ i+1 )/</ i )-l. (30) 

The probability of a transition from network % to network i + P(i — > % + 1), is 
given by P(i — > i + 1) = P C rca,i^i, and hence the average epoch length Tj follows 

as 

(3D 



P(i — >• % + 1) P C rea,i^i 



The time to convergence r conv , i.e., the time until the optimum sequence distri- 
bution has been found, is given by 



6-1 



5>- ( 32 ) 



k=l 



Equations (28), (31), and (32) capture most of the immediately observable 
quantities in an evolving population in the Neutral Staircase landscape. We have 
tested their applicability with numerical simulations of a genetic algorithm, and 
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Figure 4: Convergence time r conv in the Neutral Staircase landscape. The solid line is the 
analytical expression (32), and the points are derived from 10 independent simulations each. 
The standard error of the measured convergence times is of the size of the symbols. We used 
n = 3, b = 5, k = 3, and a population size of N = 10000. 



have found good agreement. In Fig. 3, we compare the average fitness in the dif- 
ferent epochs, (fi), to the simulation results. Figure 3a shows that the population 
fitness fluctuates around the predicted value during the metastable equilibrium, 
and that transitions between two equilibria happen fast. In Fig. 3b, we have 
made the entropic barrier between two networks smaller (we have reduced k). 
Then, all neutral sequences actually form only a single large neutral network, 
and the population transitions immediately to the highest possible level of the 
average fitness. 

In Fig. 4, we compare the time to convergence r conv with simulation results. 
We find that for fi < 3, our analytical prediction agrees well with the simula- 
tions, whereas for larger fi, the prediction fails to capture the observed behavior. 
The origin of this failure is the following. We have derived equation (32) under 
the assumption that it is hard to discover the next higher network. Now, when 
fi becomes of the order of k (we have k = 3 in Fig. 4), mutations that flip all 
k bits at once to activate the next block become likely, and hence this assump- 
tion fails. As in the case of Fig. 3b, the neutral sequences then form a single 
gigantic network, and an expression derived under the assumption of disjunct 
sub-networks must break down. 

5. Discussion 

The analysis presented in the preceding sections has shown that the topology 
of the neutral sequences in genotype space has an important influence on the 
dynamics of a population. It is therefore not justified to regard the evolution of 
a population of neutral genotypes as a simple diffusion process, unless either the 
population size or the mutation rate are very small (when the product of the 
population size, N and the genomic mutation rate, fi, is much smaller than one, 
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the population as a whole performs essentially a random walk on the neutral 
network [van Nimwegen et al., 1999a].) We have demonstrated that evolution on 
neutral networks can lead to the same kind of epochal dynamics that previously 
was thought to be caused solely by the discovery of genotypes of higher fitness. 
Of course, our results do not imply that sudden transitions in a population 
are never caused by such discoveries. Normally, a transition that we observe in a 
population evolving in an unknown fitness landscape will be due to the discovery 
of a faster replicating genotype. However, sometimes we may observe a transition 
to a higher average fitness without an increase (or even with a decrease, see 
below) in the fitness of the dominant or fastest replicating genotype. Such a 
transition is then due to the effects described in this paper. 

In most cases, selection acts first and foremost on replication rates. However, 
if all viable genotypes are identical in terms of their replication rate, as is the 
case in the fitness landscapes we have studied in this paper, the next important 
quantity selection acts upon is the probability with which genotypes have viable 
offspring. This probability is more a property of a set of genotypes than of a single 
genotype, because a genotype that itself has high reproductive success, but pro- 
duces mainly offspring with poor reproductive success, will ultimately have only 
a small number of progeny. Therefore, the selective pressure we have described 
here acts solely on clouds of mutants, in an extreme form of quasispecies-like 
selection [Eigen and Schuster, 1979, Nowak, 1992]. 

Throughout this paper, we have assumed that all sequences have the same 
replication rate a. It is natural to ask what happens when sequences with dif- 
ferent replication rates are present. The simplest such situation occurs when all 
sequences within a single neutral network i have the same replication rate <7j. 
In that case, we can take the analysis given in Sec. 2.1 one step further. The 
equilibrium fitness (fi) within a network i is then given by 



where pi is the spectral radius of the connection matrix of the network, Gj. A 
population will of course try to move to the particular network that gives the 
maximum equilibrium fitness. For a small mutation rate jl, it is clear that the 
network with the largest <7j yields the maximum equilibrium fitness. If, however, 
the mutation rate is large, such that jxpi becomes of the order of unity or even 
exceeds that value, then depending on the distribution of the p^s and the <7j's 
over the different networks, a population may actually transition from a network 
j with a higher replication rate <jj to a network % with a smaller replication rate 
<7j, while at the same time increasing its average fitness. A similar effect was 
already predicted by Schuster and Swetina [1988], although they investigated 
peaks with different support from slightly deleterious mutations, rather than 
neutral networks with different connection densities. In recent work, Ofria et al. 
[2000] have observed this selection for neutrality empirically in digital organisms. 
In a large number of evolution experiments with different mutation rates, they 




(33) 
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observed that the number of neutral neighbors of the dominant genotype (a 
crude but practical measure for the spectral radius of the connection matrix) 
would increase at higher mutation rates, at the expense of the digital organisms' 
replication rate. Similarly, Wilke et al. [2000] showed that digital organisms with 
a vastly inferior replication rate could outcompete seemingly superior digital 
organisms at high mutation rates if the slower replicating organisms had a higher 
robustness against mutations. 

It is interesting to relate our results for the average population fitness to the 
mutational load L, which is defined as [Haldane, 1937, Muller, 1950, Crow, 1970] 

L=l-^-. (34) 
a 

From equation (9), we obtain 

L = I — q l (l + pp) , (35) 

where p is the spectral radius of the connection matrix G. As was already noted 
by van Nimwegen et al. [1999a], the load can deviate significantly from Haldane's 
result L = p [Haldane, 1937] if neutrality is present [note that equation (35) 
becomes identical to Haldane's result in the absence of neutrality (p = 0) and 
in the limit of a small mutation rate]. More importantly, as a generalization of 
Haldane's result, it is often cited that the genetic load in an asexual population 
is independent of the fitness landscape, and therefore also of epistasis, and that 
it is equal to 

L — l — e _/i . (36) 

(see e.g. Kondrashov [1988], Charlesworth [1990], derivation given by Kimura and 
Maruyama [1966], Crow [1970]). This result, however, holds only in the absence of 
neutral mutants. If neutral mutants are present, then the topology of the neutral 
genotypes in the genotype space is coupled to the mutational load, by virtue of 
p in equation (35). Since the topology is also coupled to the type of epistasis 
that we observe [Wilke and Adami, 2000], epistasis must have an influence on 
the mutational load, even in asexual populations, as long as neutral mutations 
can occur. However, equation (36) is the basis of the deterministic mutational 
hypothesis of the evolution of sex [Kondrashov, 1988]: if the mutational load is 
independent of epistasis for asexual populations, but depends strongly on the 
sign of epistasis for sexual populations, then for certain types of epistasis the 
mutational load of a sexual population may be much smaller than that of an 
asexual population. Because of the differences between equations (35) and (36), 
the deterministic mutational hypothesis could break down if neutrality were 
taken into account. As a consequence, this hypothesis should be reconsidered for 
the case of landscapes with neutrality. 
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6. Conclusions 

We have shown that adaptive evolution can take place in the complete absence 
of what is ordinarily understood as advantageous genotypes. Even if the fitnesses 
of all viable genotypes are completely identical does selection favor particular 
regions of genotype space over others. What makes the difference is the den- 
sity of neutral sequences. In a region of genotype space with a higher density of 
neutral sequences, chances are higher that a mutated offspring is neutral rather 
than deleterious. Therefore, neutral sequences in such a region have a higher ro- 
bustness against mutation, and hence a higher reproductive success. This gives 
them sufficient selective advantage to outcompete sequences from a less densely 
connected region. The transitions between different such regions will often oc- 
cur in sudden jumps, followed by relatively long periods of stasis. Evolution on 
neutral networks alone can thus lead to epochal dynamics observed in so many 
natural and artificial evolving systems. 
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